SQL-to-MapReduce Translation for Efficient OLAP Query Processing with MapReduce
نویسندگان
چکیده
منابع مشابه
Reducing I/O Cost in OLAP Query Processing with MapReduce
This paper presents a method to reduce I/O cost in MapReduce when online analytical processing (OLAP) queries are used for data analysis. The proposed method consists of two basic ideas. First, to reduce network transmission cost, mappers are organized to receive only data necessary to perform a map task, not an entire set of input data. Second, to reduce storage consumption, only record IDs ar...
متن کاملCloud-Aware Processing of MapReduce-Based OLAP Applications
As the volume of data to be processed in a timely manner soars, the scale of computing and storage systems has much trouble keeping up with such a rate of explosive data growth. A hybrid cloud combining two or more clouds is emerging as an appealing alternative to expand local/private systems. However, the effective use of such an expanded cloud system is limited primarily by low network bandwi...
متن کاملRDFPath: Path Query Processing on Large RDF Graphs with MapReduce
The MapReduce programming model has gained traction in different application areas in recent years, ranging from the analysis of log files to the computation of the RDFS closure. Yet, for most users the MapReduce abstraction is too low-level since even simple computations have to be expressed as Map and Reduce phases. In this paper we propose RDFPath, an expressive RDF path query language geare...
متن کاملEfficient Big Data Processing in Hadoop MapReduce
This tutorial is motivated by the clear need of many organizations, companies, and researchers to deal with big data volumes efficiently. Examples include web analytics applications, scientific applications, and social networks. A popular data processing engine for big data is Hadoop MapReduce. Early versions of Hadoop MapReduce suffered from severe performance problems. Today, this is becoming...
متن کاملAeroacoustic post - processing with MapReduce
Present day large-scale computational fluid dynamics simulations can easily produce tens, if not hundreds, of terabytes of useful data. While computational capacity continues to increase according to Moore’s law, the speed of input-output (I/O) to data storage systems has not increased at the same rate. This means that the gap between processing speed and bandwidth to storage systems is increas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Database Theory and Application
سال: 2017
ISSN: 2005-4270,2005-4270
DOI: 10.14257/ijdta.2017.10.6.05